AITopics | directional information

Collaborating Authors

directional information

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

DDCL-INCRT: A Self-Organising Transformer with Hierarchical Prototype Structure (Theoretical Foundations)

Cirrincione, Giansalvo

arXiv.org Machine LearningApr-3-2026

Modern neural networks of the transformer family require the practitioner to decide, before training begins, how many attention heads to use, how deep the network should be, and how wide each component should be. These decisions are made without knowledge of the task, producing architectures that are systematically larger than necessary: empirical studies find that a substantial fraction of heads and layers can be removed after training without performance loss. This paper introduces DDCL-INCRT, an architecture that determines its own structure during training. Two complementary ideas are combined. The first, DDCL (Deep Dual Competitive Learning), replaces the feedforward block with a dictionary of learned prototype vectors representing the most informative directions in the data. The prototypes spread apart automatically, driven by the training objective, without explicit regularisation. The second, INCRT (Incremental Transformer), controls the number of heads: starting from one, it adds a new head only when the directional information uncaptured by existing heads exceeds a threshold. The main theoretical finding is that these two mechanisms reinforce each other: each new head amplifies prototype separation, which in turn raises the signal triggering the next addition. At convergence, the network self-organises into a hierarchy of heads ordered by representational granularity. This hierarchical structure is proved to be unique and minimal, the smallest architecture sufficient for the task, under the stated conditions. Formal guarantees of stability, convergence, and pruning safety are established throughout. The architecture is not something one designs. It is something one derives.

architecture, artificial intelligence, machine learning, (18 more...)

arXiv.org Machine Learning

2604.0188

Country: Europe > France (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

e32084632d369461572832e6582aac36-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 14:46:17 GMT

graph, laplacian, matrix, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > Wisconsin (0.05)
(4 more...)

Genre: Overview (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Hierarchical Direction Perception via Atomic Dot-Product Operators for Rotation-Invariant Point Clouds Learning

Hu, Chenyu, Li, Xiaotong, Zhu, Hao, Hou, Biao

arXiv.org Artificial IntelligenceNov-12-2025

Point cloud processing has become a cornerstone technology in many 3D vision tasks. However, arbitrary rotations introduce variations in point cloud orientations, posing a long-standing challenge for effective representation learning. The core of this issue is the disruption of the point cloud's intrinsic directional characteristics caused by rotational perturbations. Recent methods attempt to implicitly model rotational equivariance and invariance, preserving directional information and propagating it into deep semantic spaces. Yet, they often fall short of fully exploiting the multiscale directional nature of point clouds to enhance feature representations. To address this, we propose the Direction-Perceptive Vector Network (DiPVNet). At its core is an atomic dot-product operator that simultaneously encodes directional selectivity and rotation invariance--endowing the network with both rotational symmetry modeling and adaptive directional perception. At the local level, we introduce a Learnable Local Dot-Product (L2DP) Operator, which enables interactions between a center point and its neighbors to adaptively capture the non-uniform local structures of point clouds. At the global level, we leverage generalized harmonic analysis to prove that the dot-product between point clouds and spherical sampling vectors is equivalent to a direction-aware spherical Fourier transform (DASFT). This leads to the construction of a global directional response spectrum for modeling holistic directional structures. We rigorously prove the rotation invariance of both operators. Extensive experiments on challenging scenarios involving noise and large-angle rotations demonstrate that DiPVNet achieves state-of-the-art performance on point cloud classification and segmentation tasks. Our code is available at https://github.com/wxszreal0/DiPVNet.

artificial intelligence, machine learning, point cloud, (14 more...)

arXiv.org Artificial Intelligence

2511.0824

Country: Asia (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Directional Sheaf Hypergraph Networks: Unifying Learning on Directed and Undirected Hypergraphs

Mule, Emanuele, Fiorini, Stefano, Purificato, Antonio, Siciliano, Federico, Coniglio, Stefano, Silvestri, Fabrizio

arXiv.org Artificial IntelligenceOct-7-2025

Hypergraphs provide a natural way to represent higher-order interactions among multiple entities. While undirected hypergraphs have been extensively studied, the case of directed hypergraphs, which can model oriented group interactions, remains largely under-explored despite its relevance for many applications. Recent approaches in this direction often exhibit an implicit bias toward homophily, which limits their effectiveness in heterophilic settings. Rooted in the algebraic topology notion of Cellular Sheaves, Sheaf Neural Networks (SNNs) were introduced as an effective solution to circumvent such a drawback. While a generalization to hypergraphs is known, it is only suitable for undirected hypergraphs, failing to tackle the directed case. In this work, we introduce Directional Sheaf Hypergraph Networks (DSHN), a framework integrating sheaf theory with a principled treatment of asymmetric relations within a hypergraph. From it, we construct the Directed Sheaf Hypergraph Laplacian, a complex-valued operator by which we unify and generalize many existing Laplacian matrices proposed in the graph- and hypergraph-learning literature. Across 7 real-world datasets and against 13 baselines, DSHN achieves relative accuracy gains from 2% up to 20%, showing how a principled treatment of directionality in hypergraphs, combined with the expressive power of sheaves, can substantially improve performance.

artificial intelligence, hypergraph, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2510.04727

Country:

North America > United States > Wisconsin (0.04)
North America > United States > Texas (0.04)
North America > United States > Pennsylvania (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

S$^2$FS: Spatially-Aware Separability-Driven Feature Selection in Fuzzy Decision Systems

Xu, Suping, Dai, Chuyi, Liu, Ye, Shang, Lin, Yang, Xibei, Pedrycz, Witold

arXiv.org Artificial IntelligenceOct-1-2025

Feature selection is crucial for fuzzy decision systems (FDSs), as it identifies informative features and eliminates rule redundancy, thereby enhancing predictive performance and interpretability. Most existing methods either fail to directly align evaluation criteria with learning performance or rely solely on non-directional Euclidean distances to capture relationships among decision classes, which limits their ability to clarify decision boundaries. However, the spatial distribution of instances has a potential impact on the clarity of such boundaries. Motivated by this, we propose Spatially-aware Separability-driven Feature Selection (S$^2$FS), a novel framework for FDSs guided by a spatially-aware separability criterion. This criterion jointly considers within-class compactness and between-class separation by integrating scalar-distances with spatial directional information, providing a more comprehensive characterization of class structures. S$^2$FS employs a forward greedy strategy to iteratively select the most discriminative features. Extensive experiments on ten real-world datasets demonstrate that S$^2$FS consistently outperforms eight state-of-the-art feature selection algorithms in both classification accuracy and clustering performance, while feature visualizations further confirm the interpretability of the selected features.

artificial intelligence, fuzzy logic, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2509.25841

Country: North America > Canada > Alberta (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.69)

Add feedback

MagNet: A Neural Network for Directed Graphs

Neural Information Processing SystemsAug-18-2025, 04:33:24 GMT

These relations often possess a natural notion of direction.

artificial intelligence, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > Wisconsin (0.05)
(4 more...)

Genre: Overview (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

What does really matter in image goal navigation?

Monaci, Gianluca, Weinzaepfel, Philippe, Wolf, Christian

arXiv.org Artificial IntelligenceJul-3-2025

Image goal navigation requires two different skills: firstly, core navigation skills, including the detection of free space and obstacles, and taking decisions based on an internal representation; and secondly, computing directional information by comparing visual observations to the goal image. Current state-of-the-art methods either rely on dedicated image-matching, or pre-training of computer vision modules on relative pose estimation. In this paper, we study whether this task can be efficiently solved with end-to-end training of full agents with RL, as has been claimed by recent work. A positive answer would have impact beyond Embodied AI and allow training of relative pose estimation from reward for navigation alone. In this large experimental study we investigate the effect of architectural choices like late fusion, channel stacking, space-to-depth projections and cross-attention, and their role in the emergence of relative pose estimators from navigation training. W e show that the success of recent methods is influenced up to a certain extent by simulator settings, leading to shortcuts in simulation. However, we also show that these capabilities can be transferred to more realistic setting, up to some extent. W e also find evidence for correlations between navigation performance and probed (emerging) relative pose estimation performance, an important sub skill.

artificial intelligence, image understanding, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2507.01667

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Switzerland > Vaud (0.04)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.46)

Add feedback

Ignoring Directionality Leads to Compromised Graph Neural Network Explanations

Sun, Changsheng, Li, Xinke, Dong, Jin Song

arXiv.org Artificial IntelligenceJun-6-2025

Graph Neural Networks (GNNs) have emerged as a powerful tool for modeling relational data in applications such as financial fraud detection [1], [2] and social network analysis [3]. As GNNs are increasingly deployed in safety-critical domains where their decisions impact human lives and societal well-being [4], [5], ensuring their trustworthiness has become essential. Unlike traditional software systems, where correctness can often be ensured through formal verification [6], [7], deep learning models--including GNNs--function as black boxes, making it difficult to validate their decisions. To address this, explainability has become essential for deploying GNNs in real-world decision-making pipelines. Recently, post-hoc explanation methods such as GNNExplainer [8] and PGExplainer [9] are widely used to enhance user trust, facilitate model debugging for developers, and provide external validation for regulatory compliance in these black-box GNN models. A useful analogy can be drawn between explaining con-volutional neural networks (CNNs) and GNNs. As shown in Figure 1, CNN explainability method Grad-CAM [10], highlight key image regions influencing a prediction--e.g., focusing on a dog's face to classify it as "dog." Similarly, GNN explainers identify critical subgraph structures affecting predictions.

artificial intelligence, information, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2506.04608

Country: Europe > United Kingdom (0.28)

Genre: Research Report (1.00)

Industry: Law Enforcement & Public Safety > Fraud (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Add feedback

Reviews: Connective Cognition Network for Directional Visual Commonsense Reasoning

Neural Information Processing SystemsJan-25-2025, 11:11:22 GMT

Originality: The paper proposes a novel model for the recently introduced VCR task. The main novelty of the proposed model lies in the component GraphVLAD and directional GCN modules. The paper describes that one of the closest works to this work is that of Narsimhan et al., NeurIPS 2018 that used GCN to infer answers in VQA, however that work constructs an undirected graph, ignoring the directional information between the graph nodes. This paper uses directed graph instead and shows the usefulness of incorporating directional information. It would be good for this paper to include more related work on GraphVLAD front. Quality: The paper evaluates the proposed approach on the VCR dataset and compares with the baselines and previous state-of-the-art, demonstrating how the proposed work improves the previous best performance significantly.

connective cognition network, directional information, directional visual commonsense reasoning, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.42)

Add feedback

Molecule Graph Networks with Many-body Equivariant Interactions

Mao, Zetian, Li, Jiawen, Liang, Chen, Das, Diptesh, Sumita, Masato, Tsuda, Koji

arXiv.org Artificial IntelligenceJun-19-2024

In recent years, machine learning (ML) models have shown great success in materials science by accurately predicting quantum properties of atomistic systems several orders of magnitude faster than ab initio simulations [1]. These ML models have practically assisted researchers in developing novel materials across various fields, such as fluorescent molecules [2], electret polymers [3] and so on. Graph neural networks (GNNs) [4, 5] are particularly notable among ML models for atomic systems because molecules are especially suitable for 3D graph representations where each atom is characterized by its 3D Cartesian coordinate. The 3D molecular information, such as bond lengths and angles, is crucial for model learning [6, 7, 8]. However, these rotationally invariant representations may lack directional information, causing the model to view distinct structures as identical [9, 10].

artificial intelligence, machine learning, representation, (16 more...)

arXiv.org Artificial Intelligence

2406.13265

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback